Robust Speech Coding for the Preservation of Speaker Identity

نویسندگان

Mark Phythian

John Leis

Sridha Sridharan

چکیده

Low bitrate speech coding usually requires robustness to a wide range of speakers. The problem which we report on here is one where the compression rate must be maximized for the purposes of archival, but the compressed information must be available at a later date for the purposes of identifying a new speaker. The new speaker may or may not have been recorded in the archived database. As would be expected, the ability to identify a particular speaker when compared to the compressed speech information is impaired, in a manner which is related to the degree of compression. Furthermore, automatic speaker recognition algorithms depend upon a parameterization of the speech which may not be available in the quantity required in the compressed data stream. We present here our results in identifying a speaker using two common methods applied to the data stream resulting from a class of spectral vector compression algorithms. It is shown experimentally that a simpliied, easily-computed distance metric algorithm is somewhat more sensitive to the compression process when compared to a substantially more complex multivariate statistical modelling method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

شبکه عصبی پیچشی با پنجره‌های قابل تطبیق برای بازشناسی گفتار

Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...

متن کامل

MFCC and its applications in speaker recognition

Speech processing is emerged as one of the important application area of digital signal processing. Various fields for research in speech processing are speech recognition, speaker recognition, speech synthesis, speech coding etc. The objective of automatic speaker recognition is to extract, characterize and recognize the information about speaker identity. Feature extraction is the first step ...

متن کامل

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...

متن کامل

Robust Speaker Recognition in the Presence of Speech Coding Distortion for Remote Access Applications

For wireless remote access security, forensics, border control and surveillance applications, there is an emerging need for biometric speaker recognition systems to be robust to speech coding distortion. This paper examines the robustness issue for three codecs, namely, the ITU-T 6.3 kilobits per second (kb/s) G.723.1, the ITU-T 8 kb/s G.729 and the 12.2 kb/s 3GPP GSM-AMR coder. Both speaker id...

متن کامل

Codebook Design Method for Noise Robust Speaker Identification based on Genetic Algorithm

In this paper, a novel method of designing a codebook for noise robust speaker identification purpose utilizing Genetic Algorithm has been proposed. Wiener filter has been used to remove the background noises from the source speech utterances. Speech features have been extracted using standard speech parameterization method such as LPC, LPCC, RCC, MFCC, ΔMFCC and ΔΔMFCC. For each of these techn...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1996

Robust Speech Coding for the Preservation of Speaker Identity

نویسندگان

چکیده

منابع مشابه

شبکه عصبی پیچشی با پنجره‌های قابل تطبیق برای بازشناسی گفتار

MFCC and its applications in speaker recognition

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

Robust Speaker Recognition in the Presence of Speech Coding Distortion for Remote Access Applications

Codebook Design Method for Noise Robust Speaker Identification based on Genetic Algorithm

عنوان ژورنال:

اشتراک گذاری